说到特征降维/选择的问题，大部分EDA的套路都是从model训练的loss来判断feature importance。其实有一个简单易行而且很有效的办法是在CV里面用做feature permutation，对原始特征shuffle得到shadow（也可以加一些噪音），在通过zscore比较两者差异来判断importance，不断遍历筛选。在ESLII中593页有提到这个办法。R里面有一个包Boruta可以做这件事，py也有：https://github.com/scikit-learn-contrib/boruta_py

Forwarded from Data Science Archive (小熊猫)

GitHub

GitHub - scikit-learn-contrib/boruta_py: Python implementations of the Boruta all-relevant feature selection method.

Python implementations of the Boruta all-relevant feature selection method. - scikit-learn-contrib/boruta_py

www.tg-me.com/hk/Data Science Archive/com.DataScienceArchive/114

1.8K views小熊猫, Jan 26, 2022 at 05:46

tg-me.com/DataScienceArchive/114

Create: 2022-01-26
Last Update: 2025-06-26 11:55:16

BY Data Science Archive

Warning: Undefined variable $i in /var/www/tg-me/post.php on line 283

Share with your friend now:
tg-me.com/DataScienceArchive/114

Data Science Archive Telegram | DID YOU KNOW?

That strategy is the acquisition of a value-priced company by a growth company. Using the growth company's higher-priced stock for the acquisition can produce outsized revenue and earnings growth. Even better is the use of cash, particularly in a growth period when financial aggressiveness is accepted and even positively viewed.he key public rationale behind this strategy is synergy - the 1+1=3 view. In many cases, synergy does occur and is valuable. However, in other cases, particularly as the strategy gains popularity, it doesn't. Joining two different organizations, workforces and cultures is a challenge. Simply putting two separate organizations together necessarily creates disruptions and conflicts that can undermine both operations.

A Telegram spokesman declined to comment on the bond issue or the amount of the debt the company has due. The spokesman said Telegram’s equipment and bandwidth costs are growing because it has consistently posted more than 40% year-to-year growth in users.